Hypergraph-Enhanced Textual-Visual Matching Network for Cross-Modal Remote Sensing Image Retrieval via Dynamic Hypergraph Learning
نویسندگان
چکیده
Cross-modal remote sensing (RS) image retrieval aims to retrieve RS images using other modalities (e.g., text) and vice versa. The relationship between objects in the is complex, i.e., distribution of multiple types uneven, which makes matching with query text inaccurate, then restricts performance retrieval. Previous methods generally focus on feature rarely model relationships features image. Hypergraph (hyperedge connecting vertices) an extended structure a regular graph has attracted extensive attention for its superiority representing high-order relationships. Inspired by advantages hypergraph, this work, hypergraph-enhanced textual-visual network (HyperMatch) proposed circumvent inaccurate text. Specifically, multiscale hypergraph designed complex forming valuable redundant into different hyperedges. In addition, construction update method designed. For constructing running as vertices cosine similarity metric measure correlation them. Vertex hyperedge mechanisms are introduced dynamic realize alternating Quantitative qualitative experiments RSICD RSITMD datasets verify effectiveness cross-modal
منابع مشابه
Learning Hypergraph Labeling for Feature Matching
This study poses the feature correspondence problem as a hypergraph node labeling problem. Candidate feature matches and their subsets (usually of size larger than two) are considered to be the nodes and hyperedges of a hypergraph. A hypergraph labeling algorithm, which models the subset-wise interaction by an undirected graphical model, is applied to label the nodes (feature correspondences) a...
متن کاملHypergraph Based Visual Segmentation and Retrieval
This chapter explores original techniques for the construction of hypergraph models for computer vision applications. A hypergraph is a generalization of a pairwise simple graph, where an edge can connect any number of vertices. The expressive power of the hypergraph models places a special emphasis on the relationship among three or more objects, which has made hypergraphs better models of cho...
متن کاملHypergraph-based image retrieval for graph-based representation
In this paper, we introduce a novel method for graph indexing. We propose a hypergraph-based model for graph data sets by allowing cluster overlapping. More precisely, in this representation one graph can be assigned to more than one cluster. Using the concept of the graph median and a given threshold, the proposed algorithm detects automatically the number of classes in the graph database. We ...
متن کاملIntegrating Textual and Visual Information for Cross-Language Image Retrieval
This paper explores the integration of textual and visual information for cross-language image retrieval. An approach which automatically transforms textual queries into visual representations is proposed. The relationships between text and images are mined. We employ the mined relationships to construct visual queries from textual ones. The retrieval results of textual and visual queries are c...
متن کاملHypergraph-Based Image Representation
An appropriate image representation induces some good image treatment algorithms. Hypergraph theory is a theory of finite combinatorial sets, modeling a lot of problems of operational research and combinatorial optimization. Hypergraphs are now used in many domains such as chemistry, engineering and image processing. We present an overview of a hypergraph-based picture representation giving muc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
سال: 2023
ISSN: ['2151-1535', '1939-1404']
DOI: https://doi.org/10.1109/jstars.2022.3226325